Skip to content

[Docs] Document gRPC mode for PD disaggregation#3968

Merged
Bihan merged 4 commits into
dstackai:masterfrom
Bihan:add_gRPC_smg_docs
Jun 29, 2026
Merged

[Docs] Document gRPC mode for PD disaggregation#3968
Bihan merged 4 commits into
dstackai:masterfrom
Bihan:add_gRPC_smg_docs

Conversation

@Bihan

@Bihan Bihan commented Jun 15, 2026

Copy link
Copy Markdown
Collaborator

Documents the gRPC worker communication mode for PD disaggregation, added in #3946.

  • The concept docs (services.md) keep the simple HTTP example and state the two worker modes (HTTP/gRPC) in one line.
  • The full gRPC config lives in a collapsed "gRPC mode" note on the SGLang and vLLM example pages.

SGLang workers support both HTTP and gRPC; vLLM workers support gRPC only.

@Bihan Bihan changed the title Add g rpc smg docs Add gRPC smg docs Jun 15, 2026
@Bihan Bihan requested a review from peterschmidt85 June 15, 2026 15:37
Keep services.md on the simple HTTP example, state the two worker
communication modes (HTTP/gRPC) in one line, and move the gRPC config
into a collapsed "gRPC mode" note on the SGLang and vLLM example pages.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@peterschmidt85 peterschmidt85 changed the title Add gRPC smg docs [Docs] Document gRPC mode for PD disaggregation Jun 29, 2026

@peterschmidt85 peterschmidt85 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving. I pushed a few changes to simplify the docs: kept the concept page on the simple HTTP example, stated the two worker modes (HTTP/gRPC) in one line, and moved the full gRPC config into collapsed "gRPC mode" notes on the SGLang and vLLM example pages.

One thing to double-check: the gRPC examples omit model:/probes: while the HTTP ones keep them — consistent, but worth confirming it's correct for gRPC workers.

Comment thread mkdocs/docs/concepts/services.md Outdated
Comment thread mkdocs/docs/examples/inference/sglang.md Outdated
Address review feedback: "SMG worker image" was ambiguous. Note that
gRPC workers run from SMG images bundling a specific backend version.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@Bihan Bihan merged commit 50805c7 into dstackai:master Jun 29, 2026
24 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants